REMARKS 

By this amendment, Applicants have amended Claims 18 and 19 and have 
canceled the remaining claims without prejudice. In addition, new dependent claims 20- 
23 are added which reflect the subject matter of the previous dependent claims. For 
reasons as stated below, Applicants submit that the present amendments overcome all 
outstanding rejections and place this case in condition for allowance. 

In the Official Action, the Examiner objected to Claim 4, but this objection has 
become moot by virtue of the cancellation of that claim. In addition, the Examiner 
rejected the claims under 35 U.S.C. § 112 on the basis of failure to comply with the 
written description requirement. This rejection, insofar as applied to the claims as 
amended, is respectfully traversed. In particular, the claims as amended are no 
directed to antibodies recognizing amino acids 51-598 of the SdrG protein, which is also 
known as the ligand binding region A, A region or A domain. The reference to the A 
region would thus be well understood by one skilled in the art of MSCRAMM® proteins 
(see, e.g., attached Ponnuraj et al. article, p. 217, right column, reference to "N terminal 
A regions"), and thus their inclusion in the dependent claims is entirely appropriate. 
Claim 18 is now directed to an antibody recognizing this protein as encoded by its 
nucleic acid sequence, and Claim 19 is now directed to the amino acid sequence 
recognized by the antibody. This subject matter is clearly disclosed in the application, 
as was explained in detail previously during this prosecution, and Is described in the 
specification and in Figure 5B. The written description of this precise sequence would 
be well understood by one skilled in the art who would readily be able to generate the 
claimed antibodies. 
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Applicants thus submit that the claims as amended are in complete compliance 
with Section 112, first paragraph, and that this rejection should be withdrawn. 

In the Official Action, the Examiner also rejected the claims under 35 U.S.C. § 
112 on the basis of indefiniteness. In one aspect of the rejection, the Examiner 
objected to the reference to means for identifying binding of an antibody. Applicants 
submit that this objection is improper in that diagnostic kits are well known and 
understood in the art, and that anyone in this field would instantly understand what 
means for identifying binding would be. Information regarding diagnostic kits is 
provided at pages 39-40 of the specification, and the means therein are conventional 
and well known. Applicants have thus provided sufficient disclosure to support such 
means in the specification, and it is well established that the disclosure need not 
include, and preferably omits, information known to those of ordinary skill in the art in 
order to meet the written description requirement. See, e.g., Monsanto Co. v. Scmggs, 
459 F.3d 1328, 1336-37 (Fed. Cir. 2006). Applicants submit that the present claims are 
proper under Section 112 and that the Examiner's objections are respectfully traversed. 

In the Official Action, the Examiner also rejected the claims under 35 U.S.C. 
§1 02(b) as being anticipated by US Patent No. 6,380,370 to Doucette-Stamm et al. 
However, the Examiner only alleged that a "protein" was used to raise antibodies, and 
not particular portions of the full protein as in the case of the present claims. Clearly, 
the Doucette-Stamm reference did not disclose or suggest the A domain of the SdrG 
protein which is recognized by the antibodies of the present claims, and thus Doucette- 
Stamm does not anticipate or make obvious the present claims, for reasons explained 
in more detail below. 
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As was pointed out in prior responses, tine Doucette-Stamm patent is only 
directed to a disclosure of predicted nucleic acid and amino acid sequences, and refers 
to certain proteins that appear to be encoded by those sequences. As such, the 
reference at most discloses whole proteins, and does not have any disclosure or 
suggestion of specific portions of such proteins, such as the A domain. Accordingly, 
even assuming arguendo, that any proteins were actually expressed and antibodies 
were raised thereto, there is no disclosure whatsoever of antibodies to a specific region 
or portion within a complete protein. As such, this reference does not disclose or 
suggest any particular region within a sequence, much less the specific A domain 
sequence of the amended claims and antibodies thereto. Even further, there is no 
disclosure or suggestion that the actual protein may have been an extracellular matrix 
protein, and the lack of such information would make it impossible for one to have 
predicted or deduced any particular regions from the sequenced proteins and 
polypeptides. 

As a result, the disclosures of the polypeptide sequences in the cited references 
do not disclose or make obvious the present claims which are directed to antibodies 
capable of binding to a sequence consisting of amino acids 51-598 of SEQ ID NO: 10. 
This region, also known as the A domain, is nowhere disclosed or suggested in the 
cited prior art references, much less the generation of antibodies that are capable of 
recognizing this specific region. In short, the cited reference has no disclosure or 
suggestion of the present claims, and one would not have been able to arrive at the 
present invention using the cited reference which did not disclose or suggest the 
specific A domain within the polypeptides sequenced, much less antibodies thereto 
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Accordingly, Applicants respectfully submit that the present invention is not 
anticipated nor made obvious by the prior art, and that the claims of the present 
application relating to antibodies capable of binding an amino acid sequence consisting 
of amino acids 51-598 of SEQ ID NO: 10 are clearly patentable over those references. 

Applicants thus submit that the Examiner's rejection on the basis of the prior art 
is respectfully traversed and should be withdrawn. 

In light of the amendments and arguments, set forth in detail above. Applicants 
submit that the present application overcomes all prior rejections and has been placed 
in condition for allowance. Such action is respectfully requested. 



By: B. Aaron Schulman 
Registration No.: 31,877 
STITES & HARBISON PLLC*1 199 North Fairfax St*Suite 900»Alexandria, VA 22314 
Tel: 703-739-4900 ♦ Fax: 703-739-9577 ♦ Customer No. 000881 




Respectfully submitted, 



Date: November 12, 2009 
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Summary 

Gram-positive pathogens such as staphylococci con- 
tain multiple cell wall-anchored proteins that serve as 
an interface between the microbe and its environment. 
Some of these proteins act as adhesins and mediate 
bacterial attachment to host tissues. SdrG is a cell 
wall-anchored adhesin from Staphylococcus epider- 
midis that binds to the Bp chain of human fibrinogen 
(Fg) and is necessary and sufficient for bacterial at- 
tachment to Fg-coated biomaterials. Here, we present 
the crystd structures of the llgand binding region of 
SdrG as an apoprotein and in complex with a synthetic 
peptide analogous to its binding site in Fg. Analysis 
of the crystal structures, along with mutational studies 
of both the protein and of the peptide, reveals that 
SdrG binds to its ligand with a dynamic "dock, lock, 
and latch" mechanism. We propose that this mecha- 
nism represents a general mode of ligand binding for 
structurally related cell wall-anchored proteins of 
gram-positive bacteria. 

Introduction 

Staphylococcus ^idermidis is a msgor compmient of 
the normal skin flora, cotonizing the healthy human epi- 
dermis and mucous membranes. However, S. epider- 
midis has evolved into one of the leading causes of 
nosocomial sepsis; this opportunistic pathogen is now 
the most common organism isolated from nosocomial 
primary blood stream infections (BSI) (Richards et ai., 
2000). Alarmingly, antibiotic resistance is emerging 
among S. epidermidis strains and other related gram- 
positive opportunistic pathogens such as S. aureus and 
E U^catts. The emergence of these multiple antibiotk:- 
resistant strains constitutes an imminent publk: health 
crisis. Therefore, we need a better imderstanding of 
the pathogenic molecular strategies employed by these 
organisms to develop new antimicrobial drugs and treat- 
ments. 

Microbial attachment to the host tissues represents 
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the first critical step in the establishment of most bacte- 
rial infections. This process involves adhesins on the 
surface of the mterobe that bind to ligands in the host 
in "lock and key" interactions. For pathogenic bacteria 
that do not invade host cells, extracellular matrix pro- 
teins are preferred targets for bacterial adhesion. On 
gram-positive bacteria such as Staphylococci, Strepto- 
cocci, and Enterococci, tiie adhesins mediating these 
interactions have been tenned MSCRAMMs (mterobiai 
surfece components recognizing adhesive matrix mole- 
cules) (Patti and Hook, 1 994). Multiple MSCRAMMs have 
been found in many pathogenic gram-positive species, 
and their amino acid sequence relatedness, similar mod- 
ular design, and common binding domain organization 
suggest that they might have arisen from a common 
ancestor. In general, these stnictuialy related MSCRAMMs 
contain an N-terminal ~40 amino acid long signal se- 
quence (S), and C-terminal features that are required 
for sorting the proteins to the cell wall including a pro- 
line-rich wall-spanning region (W), the wall anchoring 
LPXTG motif, a hydrophobic transmembrane region (M), 
and a positively charged cytoplasmic tail (C). In addition 
to these C-termlnal features, the members of the Sdr 
family of proteins, including the fibrinogen (Fg) binding 
MSCRAMMs ClfA and ClfB of S. aureus, and SdrG of 
S. epidermidis contain a Ser-Asp repeat region (R re- 
gion) just outside the cell wall-anchoring region (see the 
SdrG prototype in Figure 1 A). The ligand binding activity 
of most of these MSCI=<AMMs has been localized to the 
N-terminal A regions that are ~500 amino ackl long 
(reviewed by Foster and Hook, 1998). In the case of 
SdrG, the ligand binding site has been further localized 
to residues 273-597 (Davis et ai., 2001). 

Fg appears to be targeted by many pathogens, and 
staphylococci can express several different proteins 
that bind to this host component. For instance, the S. 
aureus MSCRAMMs ClfA, ClfB, FnbpA, and FnbpB all 
recognize Fg. In addition, this organism can secrete 
several Fg binding proteins including Map/Eap (Jonsson 
et al., 1995, Paima et ai., 1999), Efb (Paima et ai., 2001), 
and Emp (Hussain et ai., 2001). During tissue repair, Fg 
not only controls blood loss by clotting, but it is also 
part of the host defense against microbial infections. 
In damaged tissues, the proteolysis of Fg by thrombin 
releases potent chemotactic fragments such as fibrino- 
peptide B, attracting leucocytes and fibroblasts to the 
site of injury (Kay et ai., 1974; Richardson et al., 1976; 
Seraor et al., 1 986; Skogen et al., 1 988). We have shown 
previously that SdrG's binding site in the Fg Bp chain 
overiaps the thrombin cleavage site and that SdrG can 
interfere with fibrin clot fonnation by inhibiting the 
thrombin mediated release of fibrinopeptide B (Davis 
et al., 2001). It is possible that staphylococci express 
proteins that bind to Fg to prevent the release of chemo- 
tactic elements, thus reducing the influx of phagocytic 
neutrophils to the infection site and enhancing the bac- 
terial survival in the host. 

In this report, we present the crystal structure of the 
Fg binding domain of SdrG, both as an apoprotein and 
in complex with a synthetic peptide analogous to the 




Figure 1. Representation of the iSdrGStsHnrPeptide Complex 

(A) SchemaUc representation of the SdrG molecule (Top): A, N-terminal Fg binding region; N2-N3, subdomains crystallized in the rSdrG|27KS7)- 
peptide complex; B1 and B2, repeats of unknown function; R, serine-aspartate repeat region; W, wall-spanning region; M, membrane-spanning 
region; C, cytoplasmic positivaly charged tall. The amino acid residue number Identifying the boundary between each subdomain is Indicated 
below. The Fg (pe-219 peptide sequence (BelowO. The anow indicates the thrombin cleavage site. 

(B) Stereo ribbon (Carson, 1997) representation of rSdrC«,uR)-F0 (P6-2Q) peptide analog complex. The peptide Is shown m the "bail and 

(C) Schematic representation of the topology of the iSdiQ^nan fold. The N2 and N3 domains are shown in green and yellow, respectively. 
The dotted Ibie at the end of the G' strand indicates the C tenninus of the apo-rSdr(V^ structure. In the rS(M^^ 

C-temiinal strand inserts between strands D and E of the N2 domain. 



binding site in its ligand, human Fg. Using a combination 
of site-directed mutagenesis in SdrGfj/s^ and amino 
acid replacements in ttie Fg (p6-20) peptide, we liave 
developed and examined a model for the mechanism 
of the MSCRAMM-ligand interaction. In addition, our 
analyses suggest that this mechanism is a common 
mode of ligand binding among structurally related 
MSCRAMMs from gram-positive bacteria 

Results and Discussion 

Structure of the SdrGpn.«M,-Fg Peptide Complex 
Initial characterization of the recombinant A-region of 
SdrG revealed that this protein is readily processed at 
the N terminus by a contaminating protease, yielding a 
resistant fragment con^sponding to residues 273-597 
(Rgure 1 A). Similar processing was previously observed 
in vivo for the A-region of ClfB (McAleese et al., 2001) 
and in vitro for the A-region of ClfA (Deivanayagam et 
al., 1999). We therefore reengineered a plasmid that 



would generate a recombinant fusion protein covering 
residues 273-597 of SdrG with an N-terminal His-tag for 
purification purposes. This recombinant SdrG^jTs^ was 
crystallized in complex with its ligand, a synthetic pep- 
tide designed to contain residues 6^ of the human Fg 
p-chain (Rgure 1A). The structure of the rSdrG(27fr^- 
peptide complex was determined by multiple isomor- 
phic replacement methods. The solvent flattened elec- 
tron density map was readily interpretable and allowed 
a model for the structure of the rSdrG(276^9e)-peptide 
complex to be built. The structure has been refined to 
an R ^or of 20.9% (Rim of 22.1 %) against diffraction 
data extending to 1 .86 A resc^ution (Table 1). There are 
two independent molecules in the ciystaliographic 
asymmetric unit of a P2|2|2| cell and they exhibit an 
identical structure with an rms deviation of 0.2 A on all 
Ca atoms upon superposition. 

The folding of the rSdrG,;?;^,;) molecule with a bound 
peptide is illustrated in Rgure 1 B. No electron density 
was observed for the 20 N-terminal residues, which in- 
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Table 1. Peptide Bound and Free rSdrGem«„ Ciystallographlc 
and Refinement Data 



Parameter Peptide Bound Peptide Free 


a (A) 


89.02 


44.39 




89.48 


92.76 


c(A) 


98.04 


93.92 


«0 


90 


83.62 


PO 


90 


76.06 


y<n 


90 


90.57 


Space group 


P2,2,2, 


PI 


ResohJtian (/^ 


1.86 


3.5 


Reflections total/unique 


549,956/63,746 


57,863/13,739 


Completeness (%) 


9S.7 


97.3 


R^' 


6.1 


8.4 


Numt>er of molecules In the 
asymmetric unit 


2 


4 


Refinement 


Resolution range (A) 


20-1.86 


20.0-3.S 


R factor/Rf™,'" 


0.209/0.221 


0.250/0.319 


(All data with 0 sigma cutoff) 






Average B value (A') 


20.53 


30.34 


Rmsd for bonds (A) 


0.006 


0.012 


Rmsd for angles 0 


1.27 


1.51 


Numt)er of nonhydrogen 


5003 


8216 


protein atoms 






Number of peptide atoms 


170 




Number of water molecules 


438 




Number of Ca'* ions 


2 




POBcode 


1R17 


1R19 



'Rs,„=X|lh-<lh>|/Slh, where <lh> is the average intensity over 
symmetry equivalents. 

"Number of reflections set aside for calculation of the R|,„ value for 
peptide bound and unbound stmctures were 1 239 and 669, respec- 
tively^^ 



elude 12 residues contributed by the vector HiSe-tag 
sequence and residues 273-275 of the rSdrG protein. 
Ui<ewise, no electron density was observed for one resi- 
due (two residues in the second molecule of the asym- 
metric unit) of the rSdrG C tenninus. The rSdrG(276-596) 
polypeptide folds into two distinct domains; N2 and N3. 
In agreement wKh eariier temninology (Perkins et al., 
2001), the terni N1 has been reserved for the proteolyti- 
cally processed N-terminal segment of the A-rs£pon. The 
topologies of N2 and N3, with 13% residues conserved 
between them are stmcturally similar and follow the 
DEv-IgG (DEvariant-IgG) fold (Rgure 1C). The DEv-IgG 
fold, which represents a variant of the C-type immuno- 
globulin fold, was recently identified in the crystal struc- 
ture of the apo-rClfA(22,^59|f the minimum Fg binding re- 
gion of the MSCFUMM clumping factor A (ClfA) from 
S. aureus (Deivanayagam et al., 2002). The N2 and N3 
domains in the rSdrG(i76.59«)-peptide crystal structure in- 
teract with each other and have a combined buried sur- 
face area of 1 378 The N-terminal N2-domain has 1 48 
residues (276-424), and is slightly smaller than the N3 
domain, which has 170 residues (425-597). The N2 and 
N3 domains superpose with an mns deviation of 1.8 A 
for 48 common Ga atoms and 3.7 A on all Ca atoms, 
and the differences are mostly found in the length and 
conformation of the loops joining the p strands of the 
two domains. 

The two principal p sheets of the N3 domain are com- 
posed of p strands A', B', E', and D' on one face of a 



"sandwich" and strands 01', D2', G', F', and G' on the 
opposite face (Rgure 1 C). The insertion of the additional 
strands D1 ' and 02' represent the unique feature of the 
DEv-IgG fold compared to the C-type IgG fold. In the 
N2 domain, strands 01 , 02, C, F, and G form one sheet, 
the second sheet contains an additional strand G" in- 
serted between the E and the D. This Inserted G" strand 
is not fbimed by residues from the N2 domain, but in- 
stead, it is f omied by residues SerS87-Glu597 that repre- 
sent the C-tenninal extension of the N3 domain (Figure 
10). The Inserted G" strand buries a surfece area of 
1 138 A^ of the N2 domain. 

Structural Features of the SdrG Binding Site 
A cleft of approximately 30 A in length between the N2 
and N3 domains constitutes the Fg binding region of 
rSdrG(27M96). The Fg-derived peptide fits snuggly into this 
cleft in an extended conformation, interacting with both 
the N2 and the N3 domains (Rgure 1B). A well-defined 
electron density was observed for most of the residues 
of the bound peptide (Figure 2A) except for three 
N-temninal (Asn6f, Glu7f and GluSi) and two C-temiinal 
residues (Leu19f and Asp2O0. These residues are ex- 
tended into the solvent environment, having few interac- 
tions with the SdrG molecule, and with a weak electron 
density in the 2fo-fc maps. The superposition of the 
bound peptide in the two complex structures shows an 
rms deviation of 0.1 6 A (on main chain atoms) indicating 
that the confbmnatkms of tiie peptide in both structures 
are nearly identical. 

The central region of the deft is covered by rsskiues 
lle581-SerS86, a-segment corresponding to the linker 
connecting the N3 domain G' strand with strand G" 
strand of the N2 domain, giving the binding region 
a "tunnel-like" appearance (Rgure 2B). The peptide 
reskiues Phe9af-Ala13f are positioned in this tunnel, 
appearing as if the peptide is threaded through the 
rSdrGisTMge) molecule. 

An area of 1 200 A' is buried between rSdrG,276^9e, and 
the peptMe, involving 62 contacts with less than 4 A 
between the protein and the peptide ligand. For exam- 
ple, there are 8 antiparatlel backbone hydrogen bonds 
between residues PhelOf, Ser12f, Arg14f, and IHis16f in 
the peptide and Ser584, AlaS82, Thr580, and Asp578 in 
the "linker" and the N3 domain of rSdrG(276-596) (Rgure 
20). In addition, the peptide residues Glyl 5f , Alal 3f , and 
Phellf fbnn four more backbone-backbone hydrogen 
bonds with residues His304, Ala306, and Leu340 of the 
N2 domain. The backbone atom of Phellf makes a 
hydrogen bond with the side-chain of Ser338. The side 
chains of the peptide amino acids form two hydrogen 
bonds with rSdrG,„w96): (1) Arg14f (NH1) to Tyr294 (OH) 
and (2) Ser12f (OG) to Asp339 (002). Three Phe residues 
(Phe9af, PhelOf, and PhellQ in the N temiinus exhibit a 
large number of hydroi^obk; interactions with r^dues 
from the N2 and the N3 domains and appear like a "bulgy 
plug" positioned at the end of the tunnel of the ligand 
binding cleft. 

SdrG-Ligand Binding Specificity 
Based on the rSdrG(276..sw)-peptide complex structure we 
predicted which amino acids in rSdrG(27».s9ei and in the 
peptkle are important to stabilize the adhesin-ligand 
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Figure 3. Mutational Analysis of rSclrG(27M97i and 36-20 Peptide 

(A) Surface plasmon resonance (SPR) was used to analyze the interaction of wild-type and mutated versions of tSdrQ,27^ with the 31-25 
peptide Immobilized on a BIAcore sensor chip. Unes represent ( ), SdrG^^ (— ), SdrG,2W>339A; ( ), SdrG,27»s7)S338H. 

(B) Mutation of peptide 36-20 by amino acid replacement. Ala or Ser (A or S) was used to replace each residue within Fg 3(6-20) In a panel 
of synthetic peptides. The Ala or Ser replacement residues are shown in bold. 

(C) Mutant peptide Inhibition of rSdrG,27»97) binding to Fg. 50 nM rSdrGp,»»q was prelncubated with the different synthetic peptides (10 nM) 
for 1 hr at RT and then transferred to microtiter wells coated with 1 |i.g human Fg. After incubation fori hr at RT, bound SdrG was detected 
as described In the Experimental Procedures section. Peptides with residue replacements that aftected inhibition (7-1 ^ are shown in dark bars. 
(0) The rSdrG(273^ 0 tennlnus Is essential for binding to Fg-derived peptides. Surface plasmon resonance shows the binding of rSdiGga^ 
(•• ••). rSdrG|27M83) ( ) and rSdrGpTMaj (- - -) to the 31-25 peptide immobilized on a BIAcore sensor chip. 



complex. To test these predictions, we constructed two 
point mutants by site-directed mutagenesis replacing 
residues Ser338 and Asp339, that interact wtth Phellf 
and Sen 2f of the peptide (Figure 2D), with His and Ala 
respectively. We predicted that, the S338H and D339A 
mutations would not only disturb the protein-peptide 
hydrogen bonds, but also that the S338H mutation 
would introduce a larger side chain within the binding 
region, occupying the space where the peptide docks, 
thus preventing ligand binding. We tested the ability of 
the rSdrG(„;»96)S338H and rSdrG,27^D339A to bind to 
the immobilized pi -25 peptide in the BIAcore (Rgure 
3A) and to immobilized Fg in an EUSA-type assay (data 



not shown), with identical results. Both mutations com- 
pletely abolished the ability of rSdrGfj^j^gq to bind to 
human Fg. 

A closer inspection of the rSdrG(27e.59e)-peptide com- 
plex suggests that the specificity of the peptide for 
rSdi<3(?76-596) is determined by the residues PhelOf, 
Phellf, Sen2f, Ala13f, Arg14f, and GlylSf (Figure 2D). 
To further examine which residues within the Fg p-chain 
sequence (|36-20) are necessary for peptide binding to 
rSdrG(Z73^97), we synthesized a panel of mutant peptides 
(Figure 3B) replacing each residue in the native Fg 
p-chain sequence with an Ala (or Ser if Ala is present 
in the native sequence). The ability of the synthesized 



(C) Schematic representation of rSdrG^Mq-peptide interactions. The 3 strands are colored purple and yellow for the peptide and the rSdrG 
molecule, respectively. Hydrogen bonds are indicaled by dotted lines. 

(D) Stereo close-up of ligand binding site showing (1) the interactions of residues Sef338 and Asp339 with the peptide (iQ the binding pocket 
of GlylSf. 
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Figure 4. The p6-20 Peptida Is Not the Optimal SdrG anmt Ugand 
(/^ Binding of rSdiG(27M»;, to fluoresceln-labeled synthetic peptides 
quantitated by fluorescence polarization. Increasing concentrations 
of rSdrQ(27M9i) were incubated with synthetic fluoresceln-labeled 
wild-type and mutant peptides (10 nM). The calculated Ko for each 
Interaction was: rSdiG-(p6-20) [•], 379 nM; rSdrG-O6-20G1 5S) M, 
302 nM; rSdrG-[p6-20(F3)] [A], 49 nM. 

(B) Binding of SdiGpTwantounlabeled synthetic peptides quantitated 
by isothermal titration calorimetry. A20\jM solution of SdrC^27»i!B) 
was incubated wKh an increasing concentration of synthetic pep- 
tides. The calculaled Kd for each interaction was: rSdrG-(p6-20) 
m, 380 nM; iSchfG-(p6-20G1SS) m, 300 nM; rSdiG-[p6-20(F3)] [A], 

(O Representative profiles of the relative SPR responses for the 
binding of synthetic peptides to fSdiG,mw, Immobilized on a BIA- 
core sensor chip. Colors represent: (Red), p6-20(F3); (Green), p6- 
20(G15S); Plue), p6-20; (Cyan), 36-20(F11A); (Pink), 31-25 
(scrambled). 

mutant peptides to bind to rSdrG,j7M97) was tested in an 
iniiibition assay. Peptides number 7, 8, 9, 10, 11, and 
12 were the least effective in inhibiting the binding of 
rSdrG(273^97) to immobilized human Fg, indicating that 
these peptides have a lower affinity for SdrG than the 
unmodified peptide. Using surface piasmon resonance 
we confimied that peptide number 8 (F11A) binds less 
well to immobilized Fg compared to the native p6-20 
peptide and only slightly better than a pi -25 scrambled 
sequence peptide (Figure 4C). These data indicate that 
the amino acids PhelOf, Phellf, Ser12f, Ala13f, Arg14f 
and Glyl 5f, which are replaced in peptides number 7-1 2, 



si^ificantly contribute to the binding to SdrG (Figure 
3C). The peptide residues identified by this method coin- 
cides with those shown to contact rSdrG{27e-597) in the 
cocrystal structure. Interestingly, these residues include 
Arg14f and GlylSf, the thrombin cleavage site of the 
FgBp chain. 

The crystal structure presented in Figure 1 B was ex- 
pected to show the complex of rSdrG,27g.59e) with its li- 
gand, a peptide con-esponding to residues 6-20 of the 
human Fg p-chain (Rgure 2A). However, in the crystal 
structure solved, the bound peptide contains an addi- 
tional Phe residue in between position 9 (Gly9Q and 10 
(PhelOf) (Figure 2A) that is not present in the human Fg 
sequence. Therefore, the peptide present in the cocrys- 
tal is an analog of |36-20 containing three consecutive 
Phe residues [p6-20(F3)] instead of two. This additional 
Phe residue (Phe9f), along with the two Phe residues 
present in the native sequence (PhelOf and Phellf), 
contributes to the hydrophobic stacking interactions 
with rSdrG,276-596)- It is likely that the p6-20 peptide prepa- 
ration utilized in the cocrystal complex fomiation con- 
tained a contaminating peptide with an additional Phe 
residue. Since this was the "selected" peptide in the 
crystallization process, the contaminating peptide is 
predicted to have a higher affinity for SdrG compared 
to a peptide representing the native sequence. We there- 
fore synthesized the p6-20 and p6-20(F3) peptides, puri- 
fied them, and examined their binding to rSdrG(273-597) by 
fluorescence polarization (FP) spectrosct^y, Isothemial 
titration calorimetry (ITG) and surface piasmon reso- 
nance (SPR) (Rgures 4A, 48, and 40). The results of the 
FP and ITC experiments indicate that rSdrGpTs^ binds 
more tightly to the p6-20(F3) peptide compared to the 
P6-20 peptide, reflected by an almost 8-fbld lower Kd. 

Amino acid modeling at positions SerlZf, AlalSf and 
GlylSf suggests that larger side-chain residues would 
not be accommodated at these locations. Closer exami- 
nation of the binding pockets of these residues reveaiad 
that the replacement of GlylSf with Ser would provide 
addtt'ional hydrogen bonds with Gln425, Gln579, and 
Ser437 in rSdrGpre^) (Figure 2D). Thus, this variant pep- 
tide is predicted to have a higher affinity for SdrG com- 
pared to a peptide based on the wild-type Fg sequence. 
As expected, the GISfS mutation resulted in a peptide 
that exhibited Increased binding affinities when com- 
pared to the native p6-20 peptide, both in the FP and 
the ITC assays (Figure 48). Taken together, these results 
demonstrate that the human Fg residues bound by SdrG 
do not represent the amino acid sequence of the optimal 
ligand. Similariy, in the case of the S. aureus ClfA, we 
have found synthetic peptide analogs representing 
amino acid variants of its known ligand, the G temiinus 
of the Fg 7-chain that show higher affinity for the 
MSCRAMM than the native Fg sequence (M.H., unpub- 
lished dateO. Thus, stnicturally related MSCRAMMs con- 
taining repeated immunoglobulin-like folds such as 
SdrG and ClfA can accommodate linear peptides with 
a certain degree of ligand sequence variability. 

SdrG Inhibits the Thrombin Cleavage of the Fg 
Bp-Chain: a Structural Analysis 
During blood clotting, thrombin cleaves the two Aa 
chains of Fg rapidly at ArglBf and the two Bp-chains at 
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Arg14f more slowly to release fibrinopeptldes A and 
B, respectively. Although the structure of the complex 
thrombln-fibrinopeptide B has not been reported, the 
existing structural (Malkowski et al., 1997; Martin et a!., 
1 992, 1 996), biochemical and mutational (Le Bonniec et 
al., 1 991) data on complexes of thrombln-fibrinopeptide 
A and its analogs (Blomback, 1986; Scheraga, 1986; 
Lorel et at., 1990; Binnie and Lord, 1993) show that the 
N-terminal Aa chain of Fg interacts with thrombin 
through its residues P10-P4' (a8-20). In the present 
fSdrG(27(W97)-peptide complex structure, residues P7-P4' 
(P9-18) of the peptide Interact with rSdrG(j76-597)- Thus, 
we can now provide the structural explanation for the 
previously reported (Davis et al., 2001) SdrG interference 
with thrombin-mediated release of flbrinopeptide B, as 
the SdrG binding site overiaps with the thrombin cleav- 
age sits in the Fg Bp chain. 

In addition, the rSdrGp76^-peptide complex structure 
provides an explanation for why SdrG can bind the Fg 
Bp chain but not the structurally related Aa chain. In 
the N-tenninal Aoi(7-20) (DFLAEGGGVRGPRV-thrombln 
cleavage site shown in bold) the P2, PI and PI ' positions 
are occupied by residues Val, Arg, and Gly. compared 
to Ala, Arg, and Gly in the Fg Bp chain. These sets of 
residues are very similar and both easily would fit in the 
respective pockets of SdrG. However, the P5, P4, and 
P3 positions are occupied by Gly, Gly, Gly in the Aa 
chain and Phe, Phe, and Ser residues in the Bp chain, 
respectively. These substitutions would result in the 
elimination of a critical hydrogen bond [Sen 2f (OG) with 
Asp339 (OD^l and the hydrophobic interactions that 
PhelOf and Phellf make with rSdrG(z,6^, giving the 
Aa chain a much lower binding affinity for rSdrG|27»«iQ 
compared to that of the Bp chain. 

Structure of the apo-rSdrQ(27»-597) Protein 
The structure of the apo-rSdrG(j7M96) was solved by mo- 
lecular replacement methods using the structure of the 
rSdrG(27»«9q component of the peptide-MSCRAMM com- 
ply as a starting model. The apo-rSdrG(276.5gq stHJCtive 
was solved at a 3.5 A resolution, with four molecules in 
an asymmetric unit of a PI cell. Structurally, the four 
copies are very similar, but each molecule shows several 
short disordered segments. As a result, the molecular 
model stnjctures were truncated in up to six places. A 
high degree of disorder was also observed at the C 
tenninus, where 12 to 14 of the tenninal residues could 
not be modeled. 

To compare the structures of rSdrG(276-696) as an apo- 
protein and in complex with the peptide, we chose the 
apoprotein structure that showed the best resolved C 
terminus out of four molecules available in the PI cell. 
A stnictural comparison of rSdrG(276-59« as an apoprotein 
and In complex with the peptide (Figure 5A) shows an 
rms deviation of 1.1 A for 276 Ca atoms and revealed 
that the major difference between the two forms is at 
their C temiinl. In the rSdrG(27»^ pepUde-complex 
structure, the C-terminal region forms a regular p strand 
(G^ that is inserted between strands E and D of the 
N2 domain (Figure SB). In the apo-rSdrG,27»4g6) structure 
(Rgure SC), the C terminus does not interact with the 
N2 domain, it extends into the solvent region. Therefore, 
the space between the 0 and E strands in the N2 domain 



is empty, leaving the binding cleft between the N2 and 
N3 domains widely open and presumably providing easy 
access for the ligand to dock within the binding cleft. 

The disordered regions in all four molecules of apo- 
rSdrG,276-596) in the asymmetric unit are better defined 
in the rSdrG(i7»49<rP^tide complex. For example, the 
regton encompassing amino acids Lys350-Glu355 that 
is disonJered in the apo structure forms the D strand in 
the complex structure. Interacting with strand G" of the 
N2 domain through hydrogen bonds. Similarly, Phe373- 
Glu381 region Is stabilized by the hydrogen bonds 
fomied with the N2 domain G" strand. In addition, the 
residue Tyr552 of the connecting loop (Tyr545-TyiS52) 
between E' and F' strands makes key van der Waals 
contacts by stacking against Phe9af and Phellf resi- 
dues of the peptide. The structural rearrangements and 
the direct protein-ligand interactions formed upon pep- 
tide binding result in an effectively stabilized rSdrG(276-s9e) 
complex compared to Its apoform. 

A Dynamic Model for Ligand Binding 
The structural comparison of apo-rSdrG(27e^ and the 
rSdrG(27e.s96)-Pdptide complex provides the basis for a 
ligand binding model. The open conformation of the 
rSdrGcirg^ apoprotein (Figure SC) contrasts with the 
closed conformation (Figure SB) observed for rSdrG(276-59e) 
in the protein-ligand complex. In the protein-ligand com- 
plex, the C-terminal segment of the protein, which Is an 
extension of the N3 domain, "locks" on to the "docked" 
peptide by forming a cover and sequesters It by 
"latching" on to the neighboring N2 domain. Thus, pep- 
tide binding to rSdrG(276496) appeare to involve multiple 
steps: initially, the peptide docks in the cleft. This bind- 
ing event is followed by a structural rearrangement at 
the rSdrG(276^ C terminus that begins with a deviation 
in the strand direction at lleSSI . Because of this devia- 
tion, the strand crosses over the binding cleft and locks 
the peptide in place. This crossover results in the forma- 
tion of backbone hydrogen bonds between the bound 
peptide and the covering segment (lleSSI -Ser586) of the 
adhesin, securing the peptide in the binding pocket. The 
final step is the complementation of a p sheet in the N2 
domain, inserting the C-terminal strand (G") between 
strands E and D. This last latching event stabilizes the 
overall stmcture. 

Truncation Mutants Support the Binding Model 
To examine the proposed dock, lock, and latch model of 
ligand binding, we constructed two truncation mutants: 
rSdrG|273^9) contains a midablated version of the N2 G" 
C tenninus p strand, and rSdrG(273-580) is completely de- 
void of the N2 G" C terminus p strand and the connecting 
residues between the N2 G" and the N3 G' strand. We 
hypothesized that, given the importance of these C-ter- 
minal residues in ligand binding and structure stabiliza- 
tion, the rSdrG(273^<« polypeptide would show a de- 
creased ability to bind to Fg, and that rSdrG(273^ would 
be completely unable to bind to Fg. Our results show 
that, as predicted, the rSdrG(273^g) polypeptide has a 
mari<edly reduced affinity forthe ligand, and that binding 
of rSdrG,273.5ao) to the peptide is nearty null when com- 
pared to the binding level of the intact rSdrGtzrwa?) (Figure 
3D). These data strongly suggest that our predicted 




Figure 5. Backbone Representation and Comparison of the apo-rSdrGBTMn). rSdrOpTj^-Peptide Complex and apo-rClfA,22,_559i Structures 

(A) The superposition of Co bacl<bones of rSdrGjzTMiWj-peptide fllgtit green) with the apo-rSdrG|27M97) structure (gold). 

(B) Representation of the "closed" rSdrG(2;3^-peptide complex Qight green and purple respectively) and (C) the "open" apo-rSdrG{273^ confor- 



mentation is correct. 

A Family of Structurally Related, Ceil Wall-Anctiored 
Proteins from Gram-Positive Bacteria 
We have recently solved the structures of the ligand bind- 
ing domains of the S. aureus Fg binding IVISCRAMMs; 
fGlfA(22t^9) (Figure 5D) (Deivanayagam et al., 2002) and 
rClfB(20(M54) (Champion Deivanayagam, Samuel Peridns, 
Evelyn Walsh, Timothy Foster, M.H., and S.V.LN., un- 
published data). Although these adhesins bind to differ- 
ent chains in Fg, the overall domain arrangement and 
the topology of the individual domains are strikingly 
similar. The structural data for rCtf A^zzi-ssg) and rCifB,2o»^) 
are available only for the apoprotein. However, both 
apoproteins contain a putative ligand binding cleft be- 
tween the N2 and N3 domains and an empty latching 
cleft between the D and E strands of the N2 domain 
that could accommodate a complementing "guest" 
strand. In fact, in the tCmamss^ crystal structure, this 
space is occupied by the C-terminai end of a neigh- 
boring crystailographically related molecule that forms 
a G" liite strand and occupies the cleft between the 
E and D strands of the N2 domain (data not shown). 
Interestingly, the previously identified consen/ed se- 
quence motif TYTFTDYVD (McCrea et ai., 2000) consti- 
tutes the bacl( of the latching cleft in rSdrG(273^7| (strand 
D2 and a short a-helix connecting it to the E strand in 
the N2 domain shown in gray color in Figure 5B). This 
conserved motif is found in a similar location in the 
rClfA(22i.459) and rClfBiJoo^) crystal structures. At the mo- 
ment, we do not understand the precise function of this 
conserved sequence, but it is tempting to speculate 



that it is involved in the stmcturai preservation of the 
latching cleft. 

To investigate how widespread the docl<, locl<, and 
latch model might be among bacterial cell wall proteins, 
we searched for MSCRAMIVI candidates in the genomic 
sequences from five gram-positive pathogens, in the 
analyzed genomic sequences, the number of predicted 
cell wall anchored proteins varied from 8 (S. mutants) to 
41 (£ faecalis) (Table 2). Computer-generated structural 
models suggest that lgG-lil<e subdomains an'anged in 
tandem are present in 22%-45% of the predicted cell 
wall anchored proteins. Furthermore, all of the S. aureus 
proteins containing repeated lgG-lil<e subdomains have 
a TVinuWD-lilte sequence in one of the repeats. These 
include the Fg binding domains found in CifA, ClfB, 
FnbpA, and FnbpB, the collagen binding MSCRAMMs 
Cna, the bone sialoprotein binding protein Bbp and the 
SdrC-E proteins, for which ligands so far have not been 
identified. A TYTFTDYVD-like motif is also present in 
SdrG and SdrF firom S. epidermidis, and in the first 
N-terminal IgG-like subdomain of two E faecalis ceil 
wail anchored proteins, including the collagen binding 
MSCRAMM Ace (Jouko Sillanpaa. YJC, Sreedhar Nalia- 
pareddy, Barbara Munay, and M.H., unpublished dat^. 
A recent crystal structure determination of the Ace colla- 
gen binding domain suggests that this motif forms the 
base of a cleft into which a hypothetical latching strand 
could insert (K.P, Y.X., D.C., M.H., and S.V.L.N., unpub- 
lished data). Remnants of tfiis motif are also found in 
several streptococcal proteins. The structures of these 
proteins have not yet been solved, and we do not know 
if the identified remnants of the TYTFTDYVD motif signify 
a latching cleft in one of the predteted p sheets. 
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Table 2. Summary of Predicted LPXTG-Motif Cell Wall-Anchored Proteins of Gram-Positive Organisms with Repeated 
Immunoglobulin-Uke Fold 

Number of Predicted 

LPXTG-Motif Cell Wall Number with Predicted 

Organism Anchored Proteins Repeated Ig-Like Fold 

Biterococcus hecalis 41 9 

Staphylococcus aureus 22 9 

Staphylococcus epidermidis 11 5 

Streptococcus mutans 8 3 

Streptococcus pneumoniae 19 6 

Additional details and methods used to generate this table are available as Supplemental Data (See Supplemental Table S2 available at 
http://www.cell.eom/cgl/content/full/115/2/217/DC1). 



/Another structural feature conserved in the analyzed 
proteins is an amino acid pattern in their C-terminal 
latching strand sequences. Structural analysis of the 
SdrG latching sequence [SSGQGQG (58&-592)] reveals 
that it is formed by alternating small residues (SerS86, 
Gly588, GlySSO, and Gly592) with their short side chains 
facing the larger side chains (T yr and Phe) of the TYTFT 
DYVD motif present in the opposite ^ sheet. The ciystal 
structures of apo-rClfB (Champion Deivanayagam, 
Samuel Perkins, Evelyn Walsh, Timothy Foster, M.H., 
and S.V.L.N., unpublished data) and the collagen bind- 
ing MSCRAMMs Ace (K.P., Y.X., D.C., M.H., and 
S.V.L.N., unpublished data) and Cna (Yinong Zong, Y.X., 
M.H., and S.V.L.N., unpublished data) show that their 
latching sequences also exhibit a pattem of alternating 
small residues. Modeling the G terminus of the apo- 
rClfA structure (Figure 5D) into the latching cleft of its 
N2 domain reveals a "GSGSGDG" motif that could oc- 
cupy the latching cleft, following the pattem found in 
SdrG and other MSCRAMM structures. In addition, we 
identified potential latching sequences in putative 
MSCRAMM proteins from Staphylococci and Entero- 
cocci using homology modeling and sequence align- 
ment methods and found that these sequences also 
follow a pattem of alternating small residues (Table 3). 
This ot>servation, together with the conservation of a 
TYTFTDYVD-like motif in the back of the latching cleft 



suggest that the dock, lock, and latch mechanism is 
likely to represent a general mode of ligand capture by 
the A-domains of structurally related cell wall-anchored 
adhesins of gram-positive bacteria. 

The salient features of the dock, lock, and latch model 
include a ligand binding-induced redirection of the 
C-terniinal end of the N3 domain of SdrG, followed by 
an insertion of this redirected C-terminal extension be- 
tween strands E and D in the preceding N2 domain. This 
molecular rearrangement represents an intramolecular 
p sheet complementation that is, to our knowledge, 
unique. 

Intermolecular p strand complementation is not an 
unusual mechanism of protein-protein interaction. In 
this case, a strand from one protein complements a p 
sheet in the interacting protein. An example of this type 
of protein-protein association is the interaction of the 
PapD-like chaperons with pilus subunits during pill as- 
sembly in E. coli (Sauer et al. 1999; Choudhury et al., 
1 999). Recently, the ligand binding mechanism for fibro- 
nectin binding MSCRAMMs from gram-positive bacteria 
was elucidated (Schwarz-Unek et al., 2003). In this case, 
an unordered repeated motif of the MSCRAMM interacts 
with the set of type I modules of the N-terminal domain of 
fibronectin through a multiple p strand complementation 
event called a p-zipper mechanism. This intermolecular 
p strand complementation mechanism appears to be 



Table 3. Structurally Related Ligand Bin(flng Regions from the A-Domains of Gram-Positive MSCRAMMs Share Conserved Motifs 
MSCRAMM TYTFTDYVD motif Number of Latching 



SdfG(AAF72510) 
SdrF (AAF72509) 
Clf A (CAA79304) 
ClfB(CAA1211S) 
SdrC (CAA06650) 
SdrD (CAA06651) 
SdrE (CAA06652) 
Bbp (CAB75732) 
FnbpA(CAA65106) 
FnbpB (CAA44726) 
Cna (Q53654) 
Ace (AAG23954) 



TYTFTDYVD 
TYTFTNYVD 
lYTFTDYVN 
TFVFTDYVN 
TYTFTNYVD 
TYTFTDYVD 
TYTFTDYVD 
TYKFTDYVD 
RYTFTNDIE 
RYTFKEYVQ 
TrrFNDKVE 
VFIYKDHW 



GGGDGTV 
VWTVSV 
NKANGNE 
NNAQGOG 
ANAGIEG 



'Residues in between the TYTFTDYVD nwtif and the latching sequence. 

"Alternate residues that are projected into the region between the two p-sheets and face the TYTFTDYVD motif are shown in bold. 
°Latching sequence identified method: (la) Crystal structure-this work, (lb) Crystal stnictures of A-region of collagen binding MSCRAMMS 
Ace, Cna and Fg- bindbig MSCRAMM ClfB (Champton Deivan^agam, YX, Timothy Foster, M.H., and S.V.L.N., unpublished datsq K.P., Y.X., 
D.C., M.H., S.V.L.N., uivubiished data; Yinong Zong, YX, M.H., and S.V.LN., impublished data). (^ Modeling the C-temrinus of rClfA crystal 



(4) Pi 



unique to fibronectin binding MSCRAMMs. The unor- 
dered repeats are so far only found in the C-terminal 
half of fibronectin binding MSCRAMMs and the N-ter- 
minal type I modules of fibronectin are, to this point, the 
only tigand that accommodates the MSCRAMM unor- 
dered repeats. 

The dock, lock, and latch nradel describes a bind- 
ing mechanism for the N-tenninal A domains of the 
MSCRAMMs. These domains are composed of two or 
three IgG-like folds and conserved among a large family 
of related putative MSCRAMMs. In contrast to fibronec- 
tin, the ligands that bind to the IgG-like A-domains are 
linear short peptide sequences. This ligand group is 
large (and growing) and includes fibrinogen, elastin, ker- 
atin, bone sialoprotein, collagen, and laminin. 

Conclusion 

The structure of the S. epidermidis adhesin SdrG in 
complex with a Fg-derived peptide presented here pro- 
vides the atomic details of the interactions between a 
staphylococcal MSCRAMM and its host ligand mole- 
cule. Analysis of the crystal structures of the SdrG(27(.s9e)- 
peptide complex- and the apo-SdrG|276^ allowed us to 
propose the dock, lock, and latch mechanism presented 
in this report. 

S. epidermidis is an opportunistic pathogen. A com- 
mon Inhabitant of human skin and mucosal membranes, 
S. epidermidis has emerged as a major nosocomial 
pathogen associated with infected implanted medteal 
devtoes (Huebner and GokJman, 1999) such as pace- 
makers, mechanical heart valves, artificial joints, and 
catheters (Costerton et al., 1999; Donlan, 2001). In bio- 
material-centered infections, staphylococci colonize the 
surfaces of Implanted biomateriais that are coated with 
host proteins such as Fg. These surfaces are often ex- 
posed to constant fluid shear forces. Therefore, strong 
interactions between the pathogen and the biomaterial 
are essential for bacterial adhesion and colonization. 
The now proposed dock, lock, and latch model of SdrG 
binding to Fg provkles the molecular basis for the fbrnoa- 
tion of a stable adhesin-iigand complex. This model 
describes the structural changes that stabilize the over- 
all MSCRAMM-ligand complex, and would tightly secure 
the ligand In place under fluid shear forces. In addition, 
the presented structure reveals how SdrG blocks the 
ttirombin cleavage site in Fg Bp chain, inhibiting the 
release of fibrinopeptMe B. It is possible that staphylo- 
cocci express proteins that bind to Fg to prevent the 
release of chemotactic elements such as fibrinopeptide 
B, thus reducing the influx of phagocytic neutrophils 
to the infection site and enhancing the probability of 
bacterial survival in the host Surprisingly, we found that 
the native Fg p(6-20) sequence is not the optimal se- 
quence for SdrG binding, suggesting that multiple li- 
gands could be accommodated in the MSCRAMM bind- 
ing region. 

We currently have positively identified the dock, lock, 
and latch model only for the interaction of SdrG with 
Fg. However, we have solved additional structures of 
related ligand binding domains, such as those from Clf A 
(Deivanayagam et al., 2002), ClfB (Champion Deivanaya- 
gam, Samuel Peridns, Evelyn Walsh, Timothy Foster, 



M.H.. and S.V.LN.. unpublished datsO. Ace (K.P., Y.X., 
D.C., M.H., and S.V.LN., unpublished data), and Cna 
(Yinong Zong, Y.X., M.H., S.V.LN., unpublished data). 
All of these MSCRAMMs contain a cleft in one of the IgG- 
like folds that could accommodate a latching strand. A 
survey of the available genomes of five pathogenic 
gram-positive bacteria revealed that all organisms en- 
code putative cell wall anchored proteins predicted to 
contain repeated, in tandem IgG-like folded domains. 
Furthermore, variants of the TYTFTDYVD motif, which 
appears to be associated with the latching cleft and 
putative latching sequences are also present in some 
of these cell wall anchored proteins from gram-positive 
bacteria. We therefore propose that the dock, lock, and 
latch model of ligand binding is a common theme among 
these proteins. 

Purificatioii, Crystallization, and Data Collsction 
The rSdrGcMH) polypeptida was expressed in £ coH and purified 
bv immobiiized metal chelate affinity chromatography, followed t>y 
anion-exchange and gel-filtration chromatography, as previously 
described (Davis et al., 2001). The rSdrCI|273^ protein was mixed 
with excess peptide (Peptide: protein ratio of 1.5 or greater) and 
screened for crystallization conditions. Crystals were obtained by 
hanging drop method using 50% PEG 2000 Mono Methyl Ether 
(MME), 0.2 M NaCI and 50 mm buffer (Tris, HEPES, MES, Pipes, 
Cacodylate) between pH 5.5-7.5. The crystals belong to the ortho- 
rhombic P2i2,2,spacegroupand diffracted to 1 .86 A. The crystals of 
apo-rSdrG(276-597) were obtained at similar conditions, and, however, 
belong to the triolinio PI space group. The X-ray diffraction data 
for both apo- and complex crystals were collected at 100 K using 
in house X-ray source. DENZO/SCALEPACK package (OtwInowskI 
and Minor, 1 997) was used for diffraction data processing. 

Phase Determination, Model Building, and Refinement 
rSilrGfat^Popiide Complex 

The iSdrG(278^-peptlde complex was solved by multiple Isomor- 
phous replacement method using two derivatives such as Trimethyl 
Lead acetate (TMLA) and platinum. SOLVE (Tenwilligerand Berend- 
zen, 1999) program was used for the refinement of heavy atom sites 
cind subsequent phase calculations. The initial solvent flattened 
electron density map revealed excellent density allowing the con- 
struction of a molecular model for both copies of the molecule In 
the asynmetric unit. QUANTA (Molecular Simulations, Inc) was used 
for tracing and model building and CNS (Brunger, 1998) for refine- 
ment The refinements were monitored throughout using free R val- 
ues and the model was refined to final R factor/R^^ of 0.209/0.221 
using all reflections from 20 to 1.86 A. The side chain density for 
few residues (in complex I: Lys320, Lys379, Asp474, Glu475, LysS44, 
LeuS94, and ProS96 and in complex II: Lys379, Gln519, Lys549, 
GluS67, TyrSeS, and ljeuS94) were not clear and they were modified 
to alanines. Similarly, the peptide residue Phe9af in complex I and 
Arg17f in complex ii were tnmcated to alanines due to lack of side 
chain density, i^inai refinement statistics are summarized In Table 1 . 

The stnictwe of peptide free rSdrGigMni was solved by the molecu- 
lar replacement method using Amore (Navaza, 1994), refined with 
CNS and the model built with QUANTA. The ciystal structure of 
rSdrG(:7E^537)-peptide complex (without the twund peptide) was used 
as the starting model. The first round of refinement, using 20-3.5 A 
resolution data reduced R factor to 0.287 and R^. to 0.345. A few 
cycles of further refinement and rebuilding lowared the R factor 
and Rta. to 0.250 and 0.319 respecUvely. A summary of refinement 
statistics is presented in Table 1. 

The synthesis and purification procedures of the peptides were 
described previously (Davis et al., 2001). For the following peptides 
the amino acid residue numbers are given and the sequence follows: 
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peptide pi -25, composed of first 25 residues of the N terminus of 
the Bp chain of Fg (QQVNONEEGFFSARGHRPUJKKRE^; peptide 
P6-20 (NEEGFFSARQHRPU3); peptide p6-20(F3) (NEEGFFFSARG 
HRPLD); a scrambled version of peptide pi -25 (FSERKOLHQGEGN 



The pl-2S Fg peptide was diluted in 10 mM fonnate [pH 3.5] and 
immobilized on a CMS sensorchip (BiAcore, Uppsala, Sweden). The 
immobilization level was 430 RU. The recombinant proteins were 
injected on the pepUde-coated surface at 25 iil/min Gn HBS, 10 mM 
HEPES tpH 7.5], ISO mM NaCI, and 25-0 and the binding and 
dissociation was registered in a BIAcore 3000 instrument The re- 
combinant SdrG,a»^ was diluted to SO ij^g/ml in 1 0 mM Na Acetate 
[pH 4.0] and immobilized In a CMS sensorchip. The Immobilization 
level was 1S60 RU. Tho ^ntheUc peptides, diluted to 0.1 (iM in 
HBS. were injected on the rSdrO-coated surfeice at 150 tilAnin and 
IS registered in a BIAcore 3000 in- 



The binding of rSdrG to immobilized Fg was analyzed in a solid 
phase assay as previously described (Davis et al., 2001 ). TTie binding 
ofrSdrGtofli 



with a VP-nC microcalorimeter (MicroCal Inc., Northampton, MA, 
US^ at 25°C. In atypical experiment the cell contained 20 m.M rSdrG 
and the syringe contained 200 ^.M peptide. Both solutions were in 
HBS buffer (10 mM HEPES, 150 mM NaCI, [pH 7.4}) aid were de- 
gassed at 25°C for 20-30 mln. The titration was performed as follows: 
one preliminary injection of 5 |jlI and 29 Injections of 10 (lI with an 
Injection speed of 0.5 |i.l/sec. The stining speed was 540 rpm and 
the delay time between the Injections was 5 mln. To take into ac- 
count the heats of dilution, two blank titiations were performed: one 
injecting peptide into tMjffer and another irijecting buffer into the 
rSdrG solution. The averaged heats of dilutton were subtracted from 
the main experiment. Data were analyzed using MicroCal Origin 
software (version 5.0) fitting them to a single binding site model. 

Identification of Putative Cell Wall-Anchored Proteins 
of Gram-Positive Bacteria 

Genome sequences used were: E. faeculis V583 (www.tigr.org); S. 
aureus COL (www.tigr.org); N31 5 and MuSO (www.ncbi.nlm.nih.got^; 
S. aureus MSSR (www.sanger.ac.uk); S. epidermidis RP62A (www. 
tlgr.org); S. mutans UA159 (www.ouhsc.edu); and S. pneumoniae 
ATCC BAA-334 (www.tigr.org). For genomes that have not been 
annotated, a custom-designed algorithm was used to predict open 
reading frames (ORFs) and translate the ORFs Into amino acid se- 
quences. Glimmer 2.0 obtained from TIGR was used for predicting 
the open reading frames. The open reading frames were translated 
Into amino acid sequences using an In house translation program. 
The amino acid sequences were formatted into Blast searchable 
database using fonmatdb from NCBI. Annotated genomes were for- 
matted using formatdb. U>xrG-motif containing proteins were iden- 
tified using PHI-blast, obtained from NCBI. The PHI-blast search 
uses a degenerate LPXTG pattern L-P-X-|TSA]-[GANS], X being any 
amino acid. The output sequences were further analyzed to select 
proteins containing typical features of LPXTG-motif containing cell 
wail-anchored proteins. These Include a signal peptide at tfie N ter- 
minus (predicted using Si^ialP server at www.cbs.dtu.dk/servK»s/ 
SignalP/), the LPXTG-motIf close to the C terminus (visual examina- 
tion) followed by a hydrophobic transmembrane segment (predicted 
using TMHMM server at www.cbs.dtu.dk/servicesnUHMM-2.0/) 
and several positively charged residues at tlie C terminus (visual 
examination). For annotated genomes, the term "LPKTG" or "cell 
wall" was used in searching tlie genomes for proteins annotated as 
cell wall anchored proteins. A dc 
be described elsewhere. 



Homology Modeling 

Homology modeling was carried out using the WHAT IF server 
(http://www.cmbi.kun.nl/gv/serversWIWWWI/). Initially, the pair- 
wise sequence alignment of template and target sequences was 



(httpy/ipr.georgetown.edu/pirwww/search/pairwlse.fttmQ. The aligned 
sequences, along with the template coordinate, were submitted to 
the WHAT IF server and the homology model were obtained. The 
insertions in the loop regions were modeled using the server CODA 
(fittp://www-cryst.bioc.cam.ac.uk/coda/). The short contacts side 
chain contacts were manually adjusted and the final model was 
energy minimized using CNS (Brunger, 1998). The stereochemistry 
of the model was evaluated with the program PROCHECK (Uskow- 
skl et al., 1 993) and the residue environment was analyzed with the 
VERIFY_3D program (Luthy et al., 1 992). Qualitative statisttes of the 
models are presented in Supplementai Table SI available at mp-J/ 
www.cell.com/cgi/content/full/1 1 5/2/21 7/DC1 . 



Details of bacterial strains, polymerase chain reaction amplification 
of the sdrG gene fragments, site-directed mutagenesis and purifica- 
tion of recombinant proteins are provided in the Supplemental Data, 
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Accession Numbers 

Coordinates have been deposited with the Protein Data Bank (ac- 
cession numbers AH\7 and 1R19). 



